Picture for Yuan Tian

Yuan Tian

Peter

Towards Comprehensive Benchmarking Infrastructure for LLMs In Software Engineering

Add code
Jan 28, 2026
Viaarxiv icon

AgentIF-OneDay: A Task-level Instruction-Following Benchmark for General AI Agents in Daily Scenarios

Add code
Jan 28, 2026
Viaarxiv icon

Automated Safety Benchmarking: A Multi-agent Pipeline for LVLMs

Add code
Jan 27, 2026
Viaarxiv icon

Scalable Knee-Point Guided Activity Group Selection in Multi-Tree Genetic Programming for Dynamic Multi-Mode Project Scheduling

Add code
Jan 20, 2026
Viaarxiv icon

CycleChart: A Unified Consistency-Based Learning Framework for Bidirectional Chart Understanding and Generation

Add code
Dec 22, 2025
Viaarxiv icon

Flowing from Reasoning to Motion: Learning 3D Hand Trajectory Prediction from Egocentric Human Interaction Videos

Add code
Dec 18, 2025
Viaarxiv icon

Embodied Image Compression

Add code
Dec 12, 2025
Figure 1 for Embodied Image Compression
Figure 2 for Embodied Image Compression
Figure 3 for Embodied Image Compression
Figure 4 for Embodied Image Compression
Viaarxiv icon

MACEval: A Multi-Agent Continual Evaluation Network for Large Models

Add code
Nov 12, 2025
Viaarxiv icon

GeoGen: A Two-stage Coarse-to-Fine Framework for Fine-grained Synthetic Location-based Social Network Trajectory Generation

Add code
Oct 09, 2025
Figure 1 for GeoGen: A Two-stage Coarse-to-Fine Framework for Fine-grained Synthetic Location-based Social Network Trajectory Generation
Figure 2 for GeoGen: A Two-stage Coarse-to-Fine Framework for Fine-grained Synthetic Location-based Social Network Trajectory Generation
Figure 3 for GeoGen: A Two-stage Coarse-to-Fine Framework for Fine-grained Synthetic Location-based Social Network Trajectory Generation
Figure 4 for GeoGen: A Two-stage Coarse-to-Fine Framework for Fine-grained Synthetic Location-based Social Network Trajectory Generation
Viaarxiv icon

PEHRT: A Common Pipeline for Harmonizing Electronic Health Record data for Translational Research

Add code
Sep 10, 2025
Figure 1 for PEHRT: A Common Pipeline for Harmonizing Electronic Health Record data for Translational Research
Viaarxiv icon